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Abstract 

We use bias-reduced estimators of high quantiles, of heavy-tailed distributions, to 
introduce a new estimator of the mean in the case of infinite second moment. The 
asymptotic normality of the proposed estimator is established and checked, in a 
simulation study, by four of the most popular goodness-of-fit tests for different 
sample sizes. Moreover, we compare, in terms of bias and mean squared error, our 
estimator with Peng's estimator (Peng, 2001) and we evaluate the accuracy of some 
resulting confidence intervals. 
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1 Introduction 



Let Xi,X 2 ,... be independent and identically distributed (i.i.d.) non-negative random 
variables (r.v.'s) with mean // < oo, variance a 2 and cumulative distribution function (cdf) 
F. Suppose that the tail of F is regularly varying at infinity with tail index (—a) < 0, 
that is 

1 - F (tx) 



lim 

t^oo I- F{t) 



X 



for any x > 0, 



1.1 



(see, e.g., de Haan and Ferreria, (2006), page 19). Such cdf's constitute a major subclass 
of the family of heavy-tailed distributions. It includes distributions such as Pareto, Burr, 
Student, a— stable (0 < a < 2) , and log-gamma, which are known to be appropriate 
models for fitting large insurance claims, large fluctuations of prices, log-returns, etc. 



(see, e.g. |Reiss and Thomas, (2007)] |Beirlant et al. (2001)[ |Rolski et al. (1999)D - In this 
paper, we are concerned with the construction of a bias-reduced asymptotically normal 
estimator for the mean 



xdF (x) 



which could be rewritten, in terms of the quantile function (corresponding to the cdf F) 



Q(s) := inf {x : F (x) > s} , < s < 1, 



as 



Q (1 - s) ds. 



1.2) 



For a given sample X 1? ...,X n , let 



Q n (s) := inf {x G R : F n (x) > s} , < s < 1, 

denote the sample quantile function (classical non-parametric estimator of Q) associated 
to the empirical cdf defined on the real line by F n (x) := rT 1 Y^i=x ^ — x ) > with I (•) 
being the indicator function. The natural (unbiased) estimator of /i is the sample mean 
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/ Q n {l-s)ds = -J2Xi=:X n . 
J o n i= l 



1.3) 



From the Central Limit Theorem (CLT), the sequence of r.v.'s {\/n CX n — /x) /a, n > l} 
converges in distribution to the standard Gaussian r.v., provided that the second-order 



moment E [Xf] is finite. This is a very restrictive condition in the context of heavy-tailed 
distributions as the following considerations show. Assume that the r.v. X\ follows the 
Pareto law with index a > 0, that is, 1 — F(x) = x~ a for x > 1. When a > 1, the mean 
fi exists, but E [Xf] is only finite for a > 2. Hence, the range a G (1,2) is not covered 
by the CLT and thus we need to seek another approach to handle this situation. Making 



use of Weissman's estimator of high quantiles Weissman, (1978), Peng, (2001) proposed 



an alternative estimator for /i and established its asymptotic normality for any a G (1, 2). 
Let us define the following estimator for Q : 



Qn(l-s) :-- 




for < s < k/n 
for k/n < s < 1, 



where 



Q^(l - s) := {k/nf^X^s 
is Weissman's estimator of high quantiles, with 



s | 



-i 



a 



k 1 ^ log X n _ i+hn - log X. 



n—k,n 



;i.s) 



8=1 



being the well-known Hill estimator Hill, (1975) of the tail index a, and Xi „ < ... < X n>n 
denoting the order statistics pertaining to the sample Xi, ...,X n . The number k represents 
the number of upper order statistics used in the computation of a^, it is an integer 
sequence k = k n satisfying 



1 < k < n, k — > oo and k/n — > as n — > oo. 



1.6) 



By replacing Q (1 — s) by Q n (l — t) in formula fll.2p . Peng, (2001) proposed an alternative 
estimator for /i as follows: 



>1 i-k/n 

fin=fin(k)'-= I Qn0--s)ds= / (1 - s) ds + / Q n (l - s)ds, 

JO JO Jk/n 

which, by a straightforward calculation, is equal to 

1 - 

n~k,n / , X n ... j_. J n . 

n t—^ 



p _ k gg 



n — 1 



;i-7) 



i=k+l 



provided that > 1. Moreover, the same author showed that, under suitable regularity 
assumptions, for any a G (1,2), 



where 



7=T^ > N (0,(7 (a)) , as n ->■ c», 

a 2 (a) :=a/(l-a) 4 (2-a). 



1.8) 



Throughout this paper, the standard notations A, — )■ and = respectively stand for con- 
vergence in probability, convergence in distribution and equality in distribution, while 
Af(a,b 2 ) denotes the normal distribution with mean a and variance b 2 . 



Actually, Peng, (2001) defined his estimator in the more general situation where the r.v. X 
is real (not necessarily non-negative) with lower and upper heavy tails. He simultaneously 
took into account the regular variations of both tails of G and the balance condition 

ton (1 - F (t)) I (1 - F (t) - F (-*)) = p E [0, 1] . 

t— >OQ 

In this paper, we only consider non-negative r.v.'s. Our motivation comes from the 
actuarial risk theory where insurance losses are represented by such r.v.'s. In this case, 
fin ma y be interpreted as an estimator of a risk measure called the net premium, see for 
instance Necir and Meraghni, (2009) and Brahimi et al. (2011) Note that in our case, 



since r.v. X is non-negative, we have F (— x) = for x > 0, which yields p — 1 in the 
above balance condition. 

Hill's estimator a% plays a pivotal role in statistical inference on distribution tails. This 
estimator has been thoroughly studied, improved and even generalized to any real pa- 



rameter a. Weak consistency of was established by Mason, (1982) assuming only that 



the underlying cdf F satisfies condition ( II. ip . The asymptotic normality of has been 



established (see de Haan and Peng, (1998)) under the following stricter condition that 



characterizes Hall's model (see Hall, (1982) and Hall and Welsh, (1985)). 

1 — F (x) = cx~ a + dx' 13 + o (x~ p ) , as x ->■ oo, (1.9) 
for some c > 0, d ^ and /3 > a > 0. Note that (II. 9p . which is a special case of a more 



general second-order regular variation condition (see de Haan and Stadtmuller, (1996)) 



is equivalent to 



Q (1 - s ) = c l l a s- l l a (1 + a- l C^ a ds p/a - 1 + o (1)) , as s | 0. 



1.10) 



The constants a and ft are called, respectively, first-order (tail index, shape parameter) 
and second-order parameters of cdf F. 

Extreme value based estimators essentially rely on the number k of upper order statis- 
tics involved in estimate computation. Hill's estimator has, in general, a substantial 
variance for small values of k and a considerable bias for large values of k. Hence, one 
has to look for a k value, denoted by k*, that balances between these two vices. The 
choice of this optimal value k* represents a thorny issue in the process of estimating 
the tail index and related quantities. To solve this problem, several adaptive proce- 



dures are available, see, e.g., Dekkers and de Haan, (1993), Drees and Kaufmann, (1998) 



Danielsson et al. (2001), Cheng and Peng, (2001), Neves and Fraga Alves, (2004), and the 
references therein. A theoretical optimal choice of k is obtained by minimizing the asymp- 
totic mean squared error (RMSE) of . Indeed, under condition (11. 9p . we have (see 



de Haan and Peng, (1998)] ) 

k* := (2- 1 a/3 2 (f3 - a)" 3 cT 2 c 2/3/a ) ^ n*e=* . 



2/3-2o 



;i.n) 



Though Peng's estimator enjoys the asymptotic normality property, it still has a prob- 
lem due to the fact that, it is based on Weissman's estimator known to be largely 
biased. Fortunately, many estimators with reduced biases are proposed in the literature as 



an alternative to , see, for instance, Feureverger and Hall, (1999) , Beirlant et al. (2002) 



Gomes and Martins, (2002) , Gomes and Martins, (2004) , Caeiro et al. (2004) , Caeiro et al. (2009) 



Peng and Qi, (2004) , Matthys et al (2004) , Gomes and Figueiredo, (2006) , Gomes and Pestana, (2007) 



and Beirlant et al. (2008) 



In this paper, we use the bias-reduced estimator of the high quantile Q (1 — s) , recently 



proposed by Li et al. (2010) who exploited the censored maximum likelihood (CML) 



based estimators [a, /3) of the couple of regular variation parameters (a, 0) introduced 



by Peng and Qi, (2004) The CML estimators ( a, f3 ) are defined as the solution of the 



5 



two equations (under the constraint ft > a„ 



k 2- G % (a, ft) an k Gi (a, ft) ° g X n . K 



;i.i2) 



where 



Gi (a,/3) = " (l + -^i/(n 



ft V a- ft' 



A n -4+l,n \ a P 

~x 



n—k,n 



a — ft 



H(a) 



;i.l3) 



with 



»(«> = - -rE* 



X 



re— i+l,n 



i=l 



n— fc,n 



Li et al. (2010) obtained their bias-reduced estimators Q LVY (1 — s) , of the high quantiles 



Q (1 — s) , by substituting (a, ,5) to (a,/3) in fll.lOp . That is 



Q L n PY (1 - s) := c 1/a s- 1/a (l + a^c-^ds^- 1 } , s | 0, (1.14) 



where 



a/3 



a- fin 
a/3 k 



X. 



X n —i+l,n 



n—k,n 



X 



1 1 k 

I I ■ * 



fc.re 



;i.i5) 



-L 1 i ^-re-i+l,re I 



The consistency and asymptotic normality of Qn PY (1 — s) are established by the same 
authors. Now we can define another estimator for the quantile function Q as follows: 

QL pY (1 - s ) for < s < k/n 

Q n (l — s) for k/n < s < 1. 



Qre(l " S) 



By replacing Q by Q n , in formula fll.2j) . we get 

-1 rk/n 



/ire = A*re (fc) := / QniX ~ s)ds 



Q I f Y (l-s)ds+ / Q n (l-s)ds. (1.16) 

U Jfc/re 



An elementary integral calculation leads to a new bias-reduced estimator for \i defined by 
the following formula: 



1/s f a dc-^ 5 jk/n)^- 1 ' 
67-1 p-i 



(2 n := (k/n) (nc/k) 1/S ( ^ + - '■ ™ "J ) + - £ X n _ i+1>n , 



;i.l7) 



i=fc+l 
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provided that (3 > a > 1 so that ju n be finite. 

The rest of the paper is organized as follows. In Section [21 we briefly discuss the third 
order-condition of regular variation before establishing the asymptotic normality of ju n . 
In Section (3J we carry out a simulation study to illustrate the performance of our new 
estimator Ji n and compare it with Peng's one. Proofs are relegated to Section HI Some 
concluding remarks notes made in Section |5j Finally, some of the main results used in 
SectionHJare gathered in the Appendix, as well as a very brief description of the algorithm 
of Reiss and Thomas applied, in Section [3J to select the optimal sample fraction k. 



2 Main results 

In the theory of extremes, a function, denoted by U and (sometimes) called tail quantile 
function, is used quite often. It is defined by 

U (t) := (1/ (1 - F))- 1 {t) = Q (1 - l/t) , 1< t < oo. 

In terms of this function, Hall's conditions ( jl.9p and ( 11. 10ft are equivalent to 

U (t) = c 1/a t 1/a (1 + a- 1 c~ /3/Q rft 1 - /3/Q + o (1)) , t ->• oo. (2.18) 

This implies that 

.. log [U(tx) /U(t)) -a- 1 log x x 1 -?'*-! f n , oin . 

hm " \'J — = -— , for any x > 0, 2.19 

t->°° A\ (t) 1 — p/ a 

where 

A 1 (t) := da- 1 (1 - (5 /a) c - p/a t l - p/a . 

The function A\ it) , which tends to zero as t — > oo (because > a), determines the rate 
of convergence of log [U (tx) /U (t)] to its limit a -1 logx. Relation (12.191) is known as the 
second-order condition of regular variation (see, e.g., de Haan and Ferreria, (2006)fc >age 
43). 

Unfortunately, the second-order regular variation is not sufficient to find asymptotic distri- 
butions for the estimators defined by the systems f II . 1 2 j) and (11.151) . We strengthen it into a 
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condition, called third-order condition of regular variation and given by (12.201) . that speci- 



fies the rate of (I2.19p (see, e.g., de Haan and Stadtmiiller, (1996) or Fraga Alves et al. (2007) ) 



log [U (tx) /U (t)} - a- 1 logs _ x 1 "^" - 1 
Jim Mt) . ... " l ~ P/a = J (a, ftp), (2.20) 

t^oo A 2 (t) 

where A 2 (t) — > as t — > oo, with constant sign near infinity and 

1 / x 1 ^ l3 ^ a+p — 1 x 1- ^" — 1 

D(a,P,p):= 



p\l — /3/a + p l — /3/a 



with p being a positive constant called third-order parameter. Peng and Qi, (2004) es- 
tablished the asymptotic normality of a, (3 and c under the following extract conditions 
on the sample fraction k, as n — > oo, 

(i) Vh\Ai(n/k)\-too, (ii) VkA\{n/k) 0, (m) V^Ai (n/k) A 2 (n/k) -)> 0. 

(2.21) 

As for d, it is asymptotically normal under the assumption \A\ (n/k)\ \ / log (n/k) — > 
oo added to (ii) and (Hi) . 

Example 2.1 Consider the Frechet cdf with shape parameter a > 

F (x) = exp , x > 0. (2.22) 

The corresponding tail quantile function is defined by U (t) = (— log(l — l/t))~ la , for 
t > 1. Applying Taylor's expansion (to the third order) to U and identifying with A2.18\) . 
yield (3 = 2a, c = 1 and d = —1/2. The condition A2.20\) holds for Ax (t) = t~ 1 /2a, 
A 2 (t) = (a — 3) t _2 /12a 2 and p = 3a. Other examples may be found in the recent paper 



Goegebeur and de Wet, (2011). The Frechet cdf will be employed, in Section^ as a model 



in our simulation study. 



Note that, from a theoretical point of view, assumptions ( 11. 6p and ( 12 .211) are realistic, as 
the following example shows. Indeed, let us choose k = [n e ] , < e < 1, then it easy to 
verify that these assumptions hold for any 2/3 < e < 4/5. The notation [•] stands for the 
integer part of real numbers. 
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Our main result, namely the asymptotic normality of the bias-reduced estimator ju n , 
is formulated in the last of the following four theorems. In Theorem I2.1[ we give an 
approximation of a in terms of Brownian bridges, which leads to its asymptotic normality 
stated in Theorem 12 .21 We do the same thing to /2 n in Theorem [231 It is worth mentioning 



that the asymptotic normality of a was first established by Peng and Qi, (2004) But, 
this does not meet our needs to achieve the major object of this paper. Then, we need 
to approximate both a and fi n by linear functional of the same sequence of standard 
Brownian bridges B n (s) . 

Theorem 2.1 Assume that the third order condition 20\) holds with (3 /a =: A > 1 and 
let k = k n be an integer sequence satisfying ( tl.fij) and $2.21\) . Then there exists a sequence 
of Brownian bridges {B n (s) , < s < 1} such that 

Vk(a-a) = a (rjiW ln + r) 2 W 2n + m W 3n) + o p (1) , as n ^ oo 

where W\ n , W 2n and W 3n are sequences of centered Gaussian r.v. ? s defined by 

Win := \fnJk~B n (1 - k/n) - ^/n~Jk [ 8 ~ x B n (1 - ks/n) ds, 

Jo 

W 2n := (A" 1 - 1) \fn~/kB n (1 - k/n) + (A - 1) ^Jk [ s x - 2 B n (1 - ks/n) ds, 

Jo 

W 3n : = (1 - A) v^A / s x - 2 (log s) B n (1 - ks/n) ds 

Jo 



+ \~ 2 ^/kB n (1 - k/n) - ^Jk / s x ~ 2 B n (1 - ks/n) ds, 

Jo 

and 

A 4 A 2 (2A- 1)(3A- 1) A 3 (2A-1) 2 

Theorem 2.2 Under the assumptions of Theorem \2.l\ we have 

Vk (a — a) -4 N (0, a 2 /? 4 / (a - /3) 4 ) , asn^oo. (2.23) 

Theorem 2.3 Under the assumptions of Theorem \2.1\ we have, as n — >■ oo 

n ct 

{fi n - n} = — -2 {r]iW ln + r] 2 W 2n + r] 3 W 3n } + W An + o p (1) , 



^k/^(nc/k) 1/a {a -I) 
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where W\ n , W2n and W^ n are those of Theorem \2.1\ and 



W 4n : 



jl /n B n {l-s)dQ{l-s) 



\fhjn (nc/k) 1 



Theorem 2.4 Under the assumptions of Theorem \2.1\ we have 



{fi n - //} 4 N (0, a 2 (a 



(3)) , as n 00, 



(2.24) 



yjk/n (nc/k) 



where 



a 2 



a 2 (3 A 2 2af3 2 



(2.25) 



(a - l) 4 (a - f3f 2- a ( Q - l) 2 ( a - f3) 



The following corollary to Theorem 12 .41 provides a straightforward practical way to build 
confidence intervals for /i. 

Corollary 2.1 Under the assumptions of Theorem \2.1\ we have 



where a, (3 and c are the estimates of a, (3 and c given in M.12\) and U.15\) respectively. 

3 Illustrative simulation study 

Let denote (1 — £/2)-quantile of the standard normal r.v. Given a realization (x%, x n ) 
of (Xi, ...,X n ) from a populationX satisfying the required assumptions, we construct a 
(1 — C/2) 100% confidence interval for /i via the following four steps: 

Step 1: Applying Reiss and Thomas algorithm (see subsection IA.2I of the Appendix), we 
select the optimal sample fraction k*. 

Step 2: Resolving the system fll . 12j) with k = k*, we obtain estimate values for a and /3 
that we respectively denote by a* and (3*. Then, we use the first equation of fll . 15)) to get 
the corresponding estimate c* of c. 

Step 3: Using formulas fll . 1 T)) and (|2.25p . we compute //* = jl{k*) and a (a*, (3*) respec- 
tively. 



{p, n — //}—>■ J\f (0, 1) , as n — >■ 00, 
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Step 4: Finally, Corollary 12 . 1 1 yields the (1 — C/2) 100% confidence interval for // : 

y/¥fca{a*,F){nc*/k*) lla * 

" ±2c 

Our simulation study, which is based on 200 samples of various sizes from the Frechet 
distribution (I2.22p with two distinct tail index values a = 1.5 and 1.7, consists of three 
parts. First, we compare, in terms of bias and root of the mean squared error (RMSE), the 
performances of the new estimator // n and Peng's estimator /i^ 3 . The results of this part 
are summarized in Tables [3TT1 and [3721 Second, we check the asymptotic normality of both 
estimators via four of the most popular goodness-of-fit tests at the 5% significance level: 
Cramer- von Mises (CvM), Kolmogrov-Smirnov (KS), Shapiro- Wilk (SW) and Pearson 
(P). The results of this part are summarized in Tables 13.31 and 13.41 and illustrated by 
Figures 13.11 and 13.21 Finally, we investigate the accuracy of the confidence intervals, 
built from the new estimator ju n , by computing their lengths and coverage probabilities 
(denoted by 'covpr'). The results of this part are summarized in Table 13751 (where 'lcb' 
and 'ucb' respectively stand for the lower and upper confidence bounds) and illustrated 
by Figure 13 .31 



Sample size 


500 


1000 


2000 


3000 


500 


1000 


2000 


3000 


Estimated value 


3.089 


2.999 


2.912 


2.467 


3.367 


3.119 


3.119 


2.289 


Bias 


0.411 


0.321 


0.234 


0.204 


0.689 


0.441 


0.441 


0.389 


RMSE 


0.400 


0.286 


0.125 


0.108 


0.674 


0.268 


0.268 


0.198 



Table 3.1: Point estimation of the mean based on 200 samples from the Frechet population 
with shape parameter a = 1.5. The true value of the mean is 2.678. 



Tables [3~7T1 and I3~721 show that, regardless of the sample size, our new estimator performs 
better than Peng's one as far as the bias and RMSE are concerned. Moreover, from 
the left panels of Figure 13.21 and Table 13.41 (corresponding to the case of the lighter tail 
a = 1.7) we see that the normality of ju n cannot be rejected by any of the tests when the 
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Sample size 500 1000 2000 3000 500 1000 2000 3000 



Estimated value 2.536 2.440 2.354 
Bias 0.383 0.287 0.201 

RMSE 0.340 0.211 0.114 



2.254 2.765 2.731 2.566 2.460 
0.101 0.612 0.578 0.413 0.307 
0.089 0.659 0.420 0.250 0.144 



Table 3.2: Point estimation of the mean based on 200 samples from the Frechet population 
with shape parameter a = 1.7. The true value of the mean is 2.153. 

sample size exceeds 100, while the right panels of Figure I3T21 and Table l3~4l show that the 
normality of //^ is rejected for sample sizes ranging between 100 and 200. In the case of 
the heavier tail a = 1.5, the right panels of Figure I3TT1 and Table I3T31 show that the sample 
size needs to be larger then 400 for the estimator /i^" to pass the normality tests, while 
the left panels of Figure 13.11 and Table 13.31 indicate that the normality of /2 n is accepted 
even for sample sizes smaller than 200. 



4 Proofs 



4.1 Proof of Theorem [2^1 



First recall that A = ft/a > 1. Then, from expansion (4.1) in Li et al. (2010), we have, 
as n — > oo 



a — a = a 



{tji (-Si - 1) + (S 2 - X' 1 ) + 773 (5-3 - A" 2 ) } + o p (AT 1 / 2 ) , (4.26) 



where 



and 



Sl '= TZ^ log ~y ' S2 := k 1^ ~Y ~ 

1 n—k,n \ 1 n—k,n 



i=l 



i=i 



k /yr V 1— A yr 

J- n — 7-t-l n. \ . J r 



1-A 



- n— i+l,n 

i=i 



log 



n— i+l,n 



Y, 



n—km 



(4.27) 



(4.28) 
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Sample size 


CvM 


KS 


SW 


P 


Sample size 


CvM 


KS 


SW 


P 


100 


0.447 


0.401 


0.198 


0.165 


100 


0.001 


0.000 


0.000 


0.002 


200 


0.729 


0.626 


0.795 


0.549 


200 


0.009 


0.006 


0.024 


0.104 


400 


0.267 


0.256 


0.347 


0.331 


400 


0.495 


0.446 


0.377 


0.500 


500 


0.306 


0.354 


0.410 


0.302 


500 


0.209 


0.273 


0.158 


0.329 


800 


0.374 


0.396 


0.412 


0.486 


800 


0.419 


0.321 


0.378 


0.344 


1000 


0.738 


0.706 


0.691 


0.722 


1000 


0.724 


0.711 


0.590 


0.733 



Table 3.3: Empirical p-values of normality tests for the new estimator (left panel) and 
Peng's estimator (right panel) based on 200 samples from a Frechet population with shape 
parameter a = 1.5. 

with Yi >n < ... < Y n>n being the order statistics pertaining to a sample Yi, ...,Y n of i.i.d. 
r.v.'s, defined on the same probability space as the X[s, with cdf 

G(y) = l-y-\ fory>l. (4.29) 



Csorgo et al. (1986) have constructed a probability space (Q, A,P) carrying an infinite se- 
quence Ui, U2, ••• of independent (0, 1) —uniform r.v.'s and a sequence of Brownian bridges 
{B n (s) , < s < 1} , n = 1, 2, having, amongst others, the property stated in Lemma 
14.11 Let Ui >n < ... < U n>n denote the order statistics pertaining to Ui,...,U n and define 
the empirical quantile function V n (s) as 

Vn(s) = U i)n for (i - 1) jn < s < i/n, i = l,...,n, and V n (0) = C/ lin . 



Lemma 4.1 On the probability space of Csorgo et al. (1986), for every < r < 1/2, we 
have, as n — > 00 

sup — —j^- t = O p {n ). (4.30) 

l/n<s<l-l/n (s(l — s)) 



Proof. See the proof of Theorem 2.1 in Csorgo et al. (1986) □ 
Without loss of generality, we assume that 

Yi — G" 1 (Ui) = (1 — Ui)" 1 , i = l,...,n, 



13 



Sample size CvM 


KS 


SW 


P 


Sample size 


CvM 


KS 


SW 


P 


100 


0.153 


0.076 


0.107 


0.364 


100 


0.013 


0.014 


0.004 


0.010 


200 


0.220 


0.143 


0.288 


0.249 


200 


0.278 


0.260 


0.248 


0.216 


400 


0.511 


0.515 


0.397 


0.524 


400 


0.392 


0.380 


0.298 


0.349 


500 


0.713 


0.781 


0.624 


0.635 


500 


0.520 


0.492 


0.480 


0.528 


800 


0.362 


0.311 


0.261 


0.458 


800 


0.619 


0.665 


0.408 


0.518 


1000 


0.778 


0.783 


0.645 


0.601 


1000 


0.720 


0.688 


0.485 


0.567 



Table 3.4: Empirical p-values of normality tests for the new estimator (left panel) and 
Peng's estimator (right panel) based on 200 samples from a Frechet population with shape 
parameter a = 1.7. 



and 



Yi, n = G- 1 (Ui, n ) = (1 - U^y 1 , i = 1, ...,n, 



where G 1 denotes the quantile function pertaining to cdf G given by formula (14.29)) . Then, 
this allows us to write 

Y n -i+i,n = (1 - V n (1 - > f° r " < s < -, i = 1, n. 

n n 

Making use of the previous representation of the order statistics Ki_i+i,n, we m &y rewrite 
the three statistics in (14.27)) and (I4.28j) into 

Sl = - [ k/n log - 1 ds 

k Jq \ 1 — U n -k,n ) 

s, = - [ k/n p-K(i-^) y 1+A ^ 

k Jq \ 1 — U n ^k,n 



and 

S 3 = Vl f h/n A-K(i-^) V 1+A log f i-v n (i-a " 1 

^ JO V 1 — U n _k,n J \ ^ — U n -k,n 

Next, we show that, as n — > oo 



Vk(S 1 -l) = W ln + o p (l) 
Vk (S 2 - A" 1 ) = W 2n + o p (1) 
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n 


Icb 


v> 


ucb 


covpr 


length 


n 


Icb 


V> 


ucb 


covpr 


length 


100 


-0.239 


2.992 


3.793 


0.666 


4.032 


100 


-0.524 


2.823 


3.281 


0.566 


3.805 


200 


0.971 


3.071 


3.623 


0.445 


2.652 


200 


-0.283 


2.794 


3.257 


0.681 


3.540 


400 


1.485 


3.061 


3.602 


0.603 


2.117 


400 


0.760 


2.766 


3.197 


0.700 


2.437 


500 


1.497 


3.026 


3.565 


0.666 


2.068 


500 


0.770 


2.544 


3.154 


0.733 


2.384 


800 


1.708 


2.943 


3.403 


0.785 


1.695 


800 


0.712 


2.471 


3.013 


0.800 


2.301 


1000 


1.922 


2.883 


3.302 


0.795 


1.380 


1000 


1.061 


2.451 


2.492 


0.833 


1.431 



Table 3.5: Accuracy of 95% confidence intervals for the new estimator based on 200 
samples from Frechet populations with shape parameters a = 1.5 (left panel) and a = 1.7 
(right panel). 



and 

Vk(S 3 -X' 2 ) =W 3n + o p (l), 

where W / i n ,H / 2n and W 3n are the Gaussian r.v.'s defined in Theorem 12.11 We will only 
consider the asymptotic distribution of S 3 . The proofs for Si and S 2 use similar arguments. 



By letting / (x) 



x 



A-l 



logo;, the statistic S 3 becomes 



S 3 = - (n/k) / / — 

JO V 1 — u n-k,n 



ds. 



An application of standard calculus gives L f (s) ds = —X 2 . Therefore 



S 3 - A 



-2 



(n/k) 



k/n 



f 



1 ~ Vn (1 - 
1 U n — u r 



f 



k/n J 



ds. 



Let us follow similar techniques as those used in the proof of Lemma 9 in Csorgo et al. (1985) 
We divide the integral above in two parts, then we study the asymptotic behavior of each 
integral. Observe that 

• l/n 



s 3 - X 



(n/k) 



fl 1 -V n C.\ -.s; 



1 - U n - 



n—k,n 



f 



k/n 



ds 



(n/k) 



k/n 



l/n 



J- ^n—k,n 



k/n 



ds 
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Figure 3.1: Empirical p- values of normality tests for the new estimator (left panel) and 
Peng's estimator (right panel) based on 200 samples of a Frechet population with shape 
parameter a = 1.5. 

Next, we show that \/kA n converges to in probability. Indeed, we have 1 — V n (1 — s) = 
1 — U n>ni for < s < 1/n, it follows that 



A n = (n/k) 



l/n 







./ 



1 " U r , 



1 - u, 



n—k,n 



f 



k/n 



ds 



1 ~ Un,n 
1 U n —k,n 
l/k 



l/k 



f (s) ds. 



nL/K 

An elementary calculation gives / / (s) ds = A _1 A;~ A (log A; -1 — A -1 ) , and from Lemma 

Jo 



2.2.3 of page 41 in de Haan and Ferreria, (2006), we have n (1 — U n ,n-k) /k — > 1, as n — >■ 
oo, therefore 

A n = {1 + o p (1)} k~ x log k- 1 - \- l k- x (log k- 1 - X- 1 ) . 
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Figure 3.2: Empirical p-values of normality tests for the new estimator (left panel) and 
Peng's estimator (right panel) based on 200 samples from a Frechet population with shape 
parameter a = 1.7. 
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Confidence bounds 
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Figure 3.3: Confidence intervals for the mean \i based on 200 samples of size n, ranging 
from 100 to 2000, from a Frechet population with shape parameter a = 1.5. The horizontal 
line represents the true value \i = 2.678. 
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Since A > 1 and k ->■ oo, then k~ x+1/2 ->■ and £r A+1 / 2 log fc 1 ->■ 0, it follows that 
VkA n 4 as n — > oo. Consider now the second term Vt n which may be rewritten into 



Vt n = (n/k) 



k/n 



l/n 



J- _ Un-k, 
/•k/n 



i - a 



n—k,n 



ds 



+ (n/k) 



l/n 



f 



1 — U n -h: 



~f 



k/n 



ds 



Q n i + Q n 2- 



Making use of Taylor's expansion of /, we get 



1 - U, 



n—k,n 



1 - u, 



and 



./ 



n—k,n 



~f 



f 



tPn(s) \ (I - V n (1 - s) 



1 - u, 



n—k,n 



i - a 



n—k,n 



1 - u, 



n—k,n 



where 



and 



l-U n - k , n J J \k/nj \l-U n - k ,n k/n 

min {1 - V n (1 - s) , s} < (p n (s) < max {1 -V n (l-s), s} 



mm 



<ip n < max 



1 - U n -k,n ' k/n J ' 1 1 - U n -k,n ' A;/n J ' 

Observe now that Q n i and Q n2 may be rewritten into 

"l-K(l-s) s 



(4.31) 
(4.32) 



Hnl = ("A) / fc/n f f T—ff ) 

J l/n XL — Un-k^J 



1 - u, 



n—k,n 



1 — U n -h: 



and 



where 



^ = {1 + <*(!)> < n /*)£V(j^) 



1 - XJ n -k,n k/n 



ds + tt* n2 , 



Ki ■= (n/k) 



k/n 



l/n 



1 U n —k t n J \ 1 U n —k : n 

l-K(l-s) s 



and 



fi; 2 ;= {1 + 0,(1)} (n/k) 



x 

k/n 
l/n 



1 - 17, 



n—k,n 



1 - C/, 



n—k,n 



ds, 



'f(&)-f' ' 



s 



X 



l-C/ n -fe,n fc/n. 



(is. 
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From Lemma fA. 2 1 (see the Appendix), both ykQ^i an d VkVl^ converge to in probability. 
Since n (1 — U n ^k,n) /k — > 1, then 

rk/n 
'1/r. 



0«i = {1 + o p (1)} ^ V [1 - s - K (1 - s)] rfs + o p {k- 1 ' 2 ) 



and 



fin 2 = - {1 + O p (1)} ^-(l-^-M) f ^ f • ) f ( • ) ds + Qp {k -l/2) 



(k/n) Ji/n \k/n) \k/n 

The derivative of function / equals /' (x) = (A — l)x A ~ 2 logx + x A ~ 2 , then 

Q nl = (A - 1) (n/k) [ t x ~ 2 (logt) [1 - V n (1 - fct/n) - fct/n] dt 
Ji/k 

+ (n/Jfe) / t x - 2 [1 - K. (1 - kt/n) - kt/n] dt + o p (k~ 1/2 ) 

Jl/k 



and 



^n2 = (A - 1) (n/k) [k/n - (1 - U n . k , n )] [ t^ 1 \ogtdt 

Jl/k 



'l/k 

+ (n/k) [k/n - (1 - C/ n _ fc , n )] / t A -Mt + o p (k- 1 ' 2 ) 

Jl/k 



= X- 2 (n/k) [k/n - (1 - U n ^ n )\ + o p (A;- 1 / 2 ) . 
Fix < r < 1/2, then using approximation (I4.30p . in Lemma [4.11 yields 

Vk~Q nl = (A - 1) ^n~Jk / ^ A " 2 (logt) B n (1 - fct/n) tft 

Ji/k 



+ v^A / t A_2 ^n (1 - kt/n) dt + Vk~n nl (t) + o p (1) 
Ji/k 



'l/k 

and 

Vk~n n2 = -\- 2 \fn~/kB n (1 - k/n) + \fkTl n2 (r) + o p (1) 

where 

-i 



VkQ nl (r) = (A - 1) O p (n- T ) (k/n) 1/2 ~ T (n/k) 1/2 / ^- 2 +(V2-r) | logt | df 

Jo 

+ O p (n- T ) (k/n) 1/2 - T Jn~Jk f ' t x ~ 2+ W 2 ^dt, 
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and 

Vkn n2 (r) = \- 2 p (n- T ) y^/k (k/n) 1/2 - T . 

ForA>l, / t x - 2+ ^ 2 - T ^\\ogt\dt= (A- 1/2- r)" 2 and f ^-W-*) dt = (A - 1/2 - r) _1 

Jo _ Jo 

are finite integrals. Then both quantities vM^ni and y/kil n 2 are equal to O p (/c _r ) for all 

large n, which tends in probability to as n — > oo. Recall that up to now we have showed 

that 

VkQ nl = (A - 1) y^Jk [ t x - 2 (logt) £ n (1 - fct/n) tit 

+ \fnjk [ t x ~ 2 B n (l-kt/n)dt + o p (l), 
Ji/k 

and 

Vk~{l n2 = \- 2 ^JkB n (1 - k/n) + o p (1) . 
It remains to prove that 

, rV* 

J n := (A-l)V^A/ ^(bg^^^-ifet/n)^ 
Jo 

. /■!/* 

+ a/™A / £ A ~ 2 .B n (1 - fct/n) (it, 



o 



converges, in probability, to 0. Indeed, since E \B n (1 — ks/n)\ < ^ks/n, then 

-l/k 



E |/ n | < (A - 1) / t A - 2+1 /2 (| logt | + !) dt 
Jo 



Since 

/•V* 2 

/ t A - 2 +V2 (| logt | + i) rft = _ k- x +2 (2A - log k + 2A log fc + 1) , 

Jo (2A — 1) 

which tends to as n — > oo, then I n converges to in probability. This completes the 
proof of Theorem 12. 1[ □ 



4.2 Proof of Theorem I2T2 



To establish the asymptotic normality of a, given in (12.231) , we proceed by similar argu- 
ments as for ju n in the proof of Theorem 12.41 □ 
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4.3 Proof of Theorem 

Let us divide the integral (11.21) . in two parts, as follows: 

A* = j"i,fi (k) + fi 2 ,n (k) , 

where 



"k/n 



rk/n pi 

l Q (1 — s) ds and /i 2 , n (k) '■= / Q(l — s)ds. 

JO Jk/n 



' k/r 

Recall that, in Section 1 formula (11.161) . we have defined estimator ju n of u by 

V-n = fil,n (k) + fh, n (k) , 

where 

//i,n (&) := (fc/n) nc/fc 1/Q ^ ^— + ^- 

[ a - 1 /3 _ i 

1 n 

n (k) := — / J ^n—i+l.n- 

To simplify notations, let us set 



and 



n 

i=k+l 



m- W , n ^ n W ~ ^> n ^ ' 1 = lj 2 - 
y K/n (nc/k) 

First, we consider Z ni . It is easy to verify that, as n -> oo 

u x , n (*) = {1 + o p (1)} - (nc/A;) 1/a -Ar, 

n a — 1 

and, under the condition (11. 6ft . we have 

ft,„ (*) = {1 + o p (1)} - A- (nc/k) 1/S . 

n a — 1 

It follows that 

#i,n(fc) ~ Vi,n{k) = {1 + o p (l)}- { A , (ric/k) 1/a A (nc/ft) 1/Q 

n ^ a — 1 a — 1 

Let us write Z n \ = T Xn + T 2n , where 



r lB := {1 + o p (1)} { A-r " - At) 

[a — 1 a — 1 ) 
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and 



T 2n :={l + o p (l)}Vk 



(ric/ k) 
(nc/k) 



1/2 
TJa 



We begin by showing that T 2 „ — > 0, as n — )■ oo. First observe that T 2n may be rewritten 
into 

T 2n = {l + o p (1)} Vk{(nc/k) 1/a ~ 1/a (c/c) 1/S ~ 1/a - l} . 
Assumptions (z) and (ii) of Theorem 12.11 imply that k 1 ! 2 j log (n/fc) — » oo. Also, from The- 



orem 1 of Peng and Qi, (2004) , the asymptotic normality of a gives a — a = O p (k 1 ^ 2 ) 



Therefore (1/a— l/a)\og(nc/k) —> 0, this implies that (nc/k) 



l/S-l/a p 



— >• 1, as n — )■ oo. 



On the other hand, from equation (4.7) in Li et al. (2010) , we have 



c/c — 1 = oT x (a — a) log — + o p ( fc _1//2 log • 

th V 

Since c~is a consistent estimator of c, then Taylor's expansion gives 

(c/c) 1 ^* -1 '" — 1 = a^ 1 (1 + o p (1)) (a — a) (c/c — 1) , as n — > oo. 
It suffices now to show that Vk ( (c/ c) l ^ a ~ l ^ a — 1) converges to in probability. Indeed, 



again by using the fact that a — a = O p (k x / 2 ) , yields 

Vk ((c/c) 1/a - 1/Q -l)=O p (1) L-W log £ + o p ( 



Vk 



which tends in probability to 0, because we already have \fk/ log (n/k) — > oo. Now, we 
consider the term T ln . Since S is a consistent estimator of a, then it is easy to show that 

1 + 0,(1) 



in 



(a-V 



-\fk (a — a) , as n — > oo. 



From [2TT| we infer that 
T - ( 

-Lin — 

It follows that 

Z n \ = 



1 + o p ( 1 ) ) a ^ iWin + + ^3^) ; as n ^ oo. 

(a-1) 



a 



( a - iy 



{ViW ln + v 2 W 2n + ri 3 W 3n } + o p (1) . 



(4.34) 
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Let us now consider the asymptotic distribution of Z n2 , in (I4.33p . It is shown in Csorgo and Mason, (1985) 



or more recently in Necir and Meraghni, (2009) that 

H /n B n {l-s)dQ{l-s) 



Jn2 



+ O p (1) . 

l/a 



y/k/nQ(l - k/n) 

On the other hand, from (11. 101) . we have Q (1 — k/n) ~ (nc/k) L/a , as n — > oo, it follows 
that 

Z n2 = W 4n + o p (1) . (4.35) 
Combining ( I4.34p and (I4.35P achieves the proof of Theorem 12.31 □ 



4.4 Proof of Theorem 12.4 



Now, we investigate the asymptotic normality of /i n given in( l2.24j) . Since 
are sequences of centred Gaussian r.v.'s, then 

7^— 17^{£n-/i} 4 M (0,rEr*), asn^oc, 



y/k/n (nc/k) 



where 



r 



a 



a 



a 



-T^VU — 772^2,- — 

1) (a — 1) (a 



r??3, 1 



( < i — I ) ( ( \ — I ) ( n — 1 

r* is the transpose of V and E is the variance-covariance matrix of the vector (Wi n , W, 
defined by 

1 J_ _ I A_J_ _i 

1 



An 



E 



i i 

A 2 X 



1 _ 1 

2A-1 A 2 " 



1 A-l 



(2A-1) 

_2 1_ 1 1_ 2 1_ _J_ 

X 3 A 2 (2A-1) 2 A 3 (2A-1) 3 A 4 A 2 

_i A-l _J_ 2 

A A 2 2-a J 



Note that the elements of were obtained after tedious computations of the limits of the 
expectations E [Wi n Wj n ] for i, j — 1,4 (i < j). Analogue calculus of these quantities may 



be found in Peng, (2001) , Necir and Meraghni, (2009) and Necir and Meraghni, (2010) 
Finally, a standard calculation of the product TEr* yields 



a 2 /? 4 



rsp 



(a - l) 4 (a - f3f 2 



+ 



2a/3 2 



a 



(a-l) 2 (a-/3) 2 ' 



which is denoted by o (a, (3) . This completes the proof of Theorem 12.41 



□ 
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5 Concluding notes 



The main objective of this paper was to propose a bias- reduced estimator for the mean 
of a heavy-tailed distribution. This was achieved on the basis of the bias-reduction of the 
first and second order parameter estimators of regularly varying distributions developed 



by Peng and Qi, (2004) and the corresponding high quantiles estimators introduced by 



Li et al. (2010) In addition, the newly introduced estimator is asymptotically normal, 



making confidence intervals easily constructible. We conclude by simulation that, com- 
pared to that of Peng, our new estimator has smaller bias and RMSE and consequently 
it performs better. 
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A Appendix 

A.l Auxiliary results 

Lemma A.l Let k = k n be a sequence of integers satisfying U.6\) and f (x) = x x ~ l logx, 
A > 1. Then, uniformly on s £ [1/n, k/n] , we have 
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Proof. We have n (1 — U n ^k,n) /k A 0, as n — > oo, then 



r 



1 - u, 



n—k,n 



r 



i - a 



n—k,n 



;i + 0,(1)) 



k/n 



k/n J 



A straightforward calculation of the derivative of / yields 



k/n 



k/n J \k/n 



A-2 



(A-l) 



A-2 



1 log 



<*Pn (s) 

k/n 



+ (A-l)log^£2+'^ 



A-2 



;i.36) 



Observe now, that inequalities (I4.3ip imply 



mm < -, > < L — — - 1 < max < -, 



From Wellner, (1978), we have 



\l-s-V n (l-s)\ P 
sup > as n — > oo, 

l/n<s<\ s 



it follows that 



sup 

l/n<s<k/n 



¥n (s) 



— > as n — > oo. 



;i.37) 



On the other hand, in view Lemma 3 in Necir and Meraghni, (2009) , we infer that 



sup 

l/n<s<k/n 



- 1 



— > as n — > oo. 



(1.38) 



By applying the mean value theorem to the functions x — > log x and x — > x x 1 respectively, 
then by using (I1.37P and (11.38p . we show readily that, as n — > oo 

A-l 
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l/n<s<k/n 



log 



<^n (s) 
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l/n<s<k/n 



1 



4 0. 
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Note that the first result of (I1.39P implies that 



sup 

l/n<s<k/n 



log — log ■ 



— >• 0, as n — >• oo. 



1.40) 



k/n k/n 

By using equations ( 11.39P and f ll.40p together, we show that, uniformly in s G [1/n, k/n 



the right-hand side of equation (I1.36P , is equal to o p (1) 



A-2 



log 



k/n) k/n 
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Lemma A. 2 We have \fk£l* nl A and VkQ* 9 A 0, as n — »■ oo. 



n2 ^ w ) 



Proof. We only show the first result. The second one is obtained by similar arguments. 
Recall that 



Ki ■= ~ (n/k) 
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1 U n —k,n J \ 1 U n — 
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Using Lemma [A. 11 we get, as n — > oo 
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By a change of variables, we get 



log 



1 Un—k,n 



ds. 



k/n 



l-K(i-s) 



A;/n fc/n 



(is. 
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n r ) ) ds 



Making use of approximation( H~30l) yields 

Vkn* nl = o p (1) (n/k) 1/2 C s x ~ 2 |logs| (\B n (1 - ks/n)\ + (ks/n) 1/2 ~ T O p ( 
In other words 

y/kQ* nl = o p (l) {n/k) 1/2 [ s x ~ 2 |logs| \B n (1 - fcs/n)| ds. 

Ji/k 

+ o p (k' T ) [ S x - Z,2 - T |log s\ds. 
Jl/k 

The expectation of the first term of right-hand side of the previous equation is less than 
or equal to 

o p {l){n/k) l/2 [ s x - 2 \logs\B[\B n (l-ks/n)\]ds. 
Jo 

Using the fact that E \B n (1 — ks/n)\ < (ks/n) 1 ^ 2 , we show that the previous quantity 
is less than or equal to o p (1) / s A ~ 3 / 2 |log s\ ds. Since both integrals / s x -?>/2-t |i Q g S | ds 



•i 



JO JO 

and / s A - 3 / 2 - T | i g s | d s are finite, then \fkVl* nl = o p (1) , as n — > oo. 
Jo 
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A. 2 Optimal choice of the sample fraction k 



Reiss and Thomas in Reiss and Thomas, (2007) , page 137, proposed a heuristic method 



for choosing the optimal number of upper extremes used in the computation of the tail 
index estimate. In this paper, we adopt this algorithm by making use of Peng and Qi 
estimator a = a(k) which is defined by the system of two equations (j!.12p . By this 
methodology, one defines the optimal sample fraction of upper order statistics k* by 

1 k 

k* := argmin — '^^i 6 \oe (i) — median {a (1) , a (k)}\ , 

8=1 

with suitable constant < 9 < 1/2. The quantity a(i) corresponds to Peng and Qi 
estimator of the shape parameter a, based on the i upper order statistics. On the light of 
our simulation study, we obtained reasonable results by choosing 9 = 0.3. The same value 



for 9 has also been observed by Neves and Fraga Alves, (2004) when employing Hill's 



estimator. The software programs of this methodology are incorporated in the "Xtremes" 



package accompanying the book of Reiss and Thomas, (2007) 
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